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(57) The invention relates to an apparatus (36) for 
sending data to an ATM network and receiving data from 
the ATM network comprising: 

(a) a bus interface (38) for interfacing with a bus 
supporting communication between a host, a stor- 
age device and the apparatus; 

(b) an ATM interface (40) for interfacing with the 
ATM network; 

(c) a transmission unit (50) for transmitting outgoing 
data from the bus interface to the ATM interface, the 
transmission unit including 

(1 ) a first RAM interface (58) for interfacing with 
RAM (42) being used as a buffer for buffering 
the outgoing data from the bus interface, 

(2) means for segmenting (70) the outgoing da- 
ta from the buffer into outgoing ATM cells, and 

(3) a traffic shaper (62), for controlling traffic of 
the outgoing ATM cells to the ATM interface in 
cooperation with the means for segmenting; 
and 

(d) a reception unit (80) for transmitting incoming 
data from the ATM interface to the bus interface, the 
reception unit including 

(1) means for performing VPI/VCI filtering (84) 
of incoming ATM cells, 

(2) means for reassembling (86) the incoming 
data using payload of the incoming ATM cells, 



and 

(3) a second RAM interface (94) for interfacing 
RAM (46) being used as a buffer for buffering 
the incoming data the means for reassembling. 
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Description 

The present invention relates to servers for data de- 
livery. Traditionel servers were designed with the ten- 
dency to be actively involved in the physical transmis- 
sion of data. For applications such as video on demand 
or karoake on demand, deliverance of a high number of 
digital video streams in-real time a'fe required. The'digital' 
video stream typically include video data compressed 
according to I SO/I EC 11172 or ISO/IEC 13818, which 
are commonly known as MPEG-1 standard and MPEG- 
2 standard respectively. 

An ATM (Asynchronous Transfer Mode)-base serv- 
er system with capabilities extending beyond mere data 
delivery has already been proposed in the European 
Patent Application No. 95.200819.1. 

A streaming engine for ATM communication config- 
ured as an ASIC (Application Specific Integrated Circuit) 
has been proposed in the international application WO 
96/08896 published under PCT 

A method for recording and reproducing com- 
pressed video data according to the MEPG standard is 
proposed in European Patent Application EP 0667 713 
A2. In this case, the compressed video data.is.recorded- 
in the disk in the special form including the scan infor- 
mation so that the specific compressed video data re- 
producing apparatus can achieve VCR functions (e.g. 
FF, FR). 

It is an object of the present invention to improve 
upon the above mentioned prior art and/or to provide a 
server for future applications. 

The present invention provides an apparatus for 
sending data to an ATM network and receiving data from 
the ATM network comprising: 

(a) a bus interface for interfacing with a bus sup- 
porting communication between a host, a storage 
device and the apparatus; 

(b) an ATM interface for interfacing with the ATM 
network; 

(c) a transmission unit for transmitting outgoing data 
from the bus interface to the ATM interface, the 
transmission unit including 

(1) a first RAM interface for interfacing with 
RAM being used as a buffer for buffering the 
outgoing data from the bus interface, 

(2) means for segmenting the outgoing data 
from the buffer into outgoing ATM cells, and 

(3) a traffic shaper, for controlling traffic of the 
outgoing ATM cells to the ATM interface in co- 
operation with the means for segmenting; and 

(d) a reception unit for transmitting incoming data 
from the ATM interface to the bus interface, the re- 
ception unit including 

(1 ) means for performing VPI/VCI filtering of in- 



coming ATM cells, 

(2) means for reassembling the incoming data 
using payload of the incoming ATM cells, and 

(3) a second RAM interface for interfacing RAM 
s being used as a buffer for buffering the incom- 
ing data the means for reassembling. 

" r " The apparatus according to the present invention 
provides for management of running applications that 

10 interact with a large n umber of clients and management 
modules distributed over a system, as well as manage- 
ment of the data that are delivered. The server accord- 
ing to the present invention provides time or processing 
power to run higher level management tasks, as the host 

*s is less actively involved in physical transmission of data. 
The hardware according to the present invention is able 
to deliver data in real time under different performance 
requirements and is well suited for such real time deliv- 
ery. The streaming engine according to the present in- 

20 vention is able to support simultaneous communications 
with many clients and to facilitate the video streaming 
task. The server according to the present invention, also 
provides for interoperability, such as to serve data to any 
- v^type of client-.nThe conteratflo be delivered, can be stored 

25 in a versatile form (i.e. raw or non formatted form) in the 
server according to the present invention. 

Preferred embodiment of the apparatus according 
to the present invention are described in the dependent 
subclaims. 

30 Further the present invention provides a system 
and method for serving a n umber of clients in a versatile 
way. 

Further advantages, features and details of the 
present invention will become clear when reading the 
3$ following description, in which reference is made to the 
annexed drawings, in which: 

figure 1 shows a general system architecture of an 
interactive communication system; 
40 figure 2 shows a detail block diagram of an embod- 
iment of the apparatus according to the present in- 
vention; 

figure 3 shows a block diagram of the Tx address 
translator of figure 2; 
45 figure 4 shows an example of the use of the address 
translator of figure 3; 

figure 5A, 5B, 5C show respective examples of ad- 
dress translation for TCP IP packetisation; 
figure 6 shows an example of use of the Tx rate 

50 block of figure 2; 

figure 7 shows the behaviour of a bit rate achieved 
by the traffic shaper of figure 2; 
figure 8 shows a diagram for explaning the sending 
of stream within one cell period; 

55 figure 9 shows the submission of cells for different 
classes; 

figure 10 is a block diagram of an architecture for 
the traffic shaper of figure 2; 
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figure 11 shows a block diagram of the command 
block of figure 2; 

figure 12 is a diagram lor explaning the operaiion 

of the byte swapper of figure 2; 

figure 13 shows a format of one ATM cell used in 

UNI; 

figure 14 shows a block diagram of the VPI/VCI 
translator of figure 2; *' 1 ; ' " >4 " : 
figure 15 shows a block diagram of the pattern de- 
tector of figure 2; and 

figure 16 shows an example of an address transla- 
tor of figure 2. 

Figure 1 shows a general system architecture of a 
preferred embodiment of an interactive communication 
system. This is a broad-band system that supports vir- 
tually any kind of interactive multi-media application. 
Particular attention is paid to real time multimedia deliv- 
ery mode applications. 

A server 10 functions as VOD (Video On Demand) 
server, KOD (Karaoke On Demand) server, and/or In- 
ternet server, etc. and communicates with STBs (Set 
Top Box) 18 as clients over a public network 16. The 
server 10 consists of a local ATM«switch 14 and several 
SMUs (Storage Medium Unit) 12 that are interconnect- 
ed thorough the local ATM switch 14. The main purpos- 
es of the local ATM switch 14 are to route data between 
the SMUs 1 2 (for instance, to duplicate a movie com- 
pressed according to the MPEG standard from one 
SMU to another), create a ATM-based LAN inside the 
server 10, and interface to the public network 16. Each 
SMU 12 communicates with the local ATM switch 14 at 
high speed, with current technology at e.g. a maximum 
of 622 Mbps. The public network 16 is optional and the 
server 10 may directly communicates with the STBs 18. 

Figure 2 shows a detail block diagram of the SMU 
12. The SMU 12 has storage devices 20, a host 2B and 
a streaming engine 36 as major units. These units are 
interconnected via a PCI (Peripheral Component Inter- 
connect) bus 24. A host CPU 30 and a host memory 32 
in the host 28 are connected via MIPS bus 34 in a con- 
ventional configuration. In this embodiment the MIPS 
bus 34 is connected to the PCI bus 24 thorough a PCf 
bridge 26. The host 28 is primarily intended for running 
applications like VOD, KOD, internet server that interact 
with clients or STBs. 

The storage devices 20 contains one or more 
strings of hard disks. These hard disks are connected 
via SCSI or Fibre Channel and store real time sensitive 
data like MPEG-2 encoded video streams and the con- 
tents of data packets like the body of TCP/I P (Transmis- 
sion Control Protocol/Internet Protocol) packets without 
the headers. 

The streaming engine 36 is preferably configured 
as a single ASIC (Application Specific Integrated Cir- 
cuit). The streaming engine 36 streams the real time 
sensitive data and the data packets. The streaming en- 
gine 36 has a transmission path 50 and a reception path 



80 as major parts and a PCI interface 38 and an inter- 
face 40. The transmission path 50 handles the outgoing 
data stream from the storage devices 20 and the host 
28 to the local ATM switch 14. The reception path 80 

s handles the incoming data stream from the local ATM 
switch 1 4 to the storage devices 20 and the host 28. The 
high speed connections and the independence of the 
transmission path and receptton path allow for 622Mbps 
simultaneously in both directions. 

10 The PCI interface 38 interfaces the PCI bus 24 with 
the transmission path 50 and the reception path 80. The 
PCI interface 38 transfers the outgoing data stream from 
the PCI bus 24 to a PCI FIFO 52 in the transmission 
path 50 and transfers the incoming data stream from a 

is PCI FIFO 98 in the reception path to the PCI bus 24. 
The interface 40 interfaces the transmission path 
50 and the reception path 80 with an external physical 
layer device (not shown) connected to the local ATM 
switch 1 4. The interface 40 can include two types of ATM 

20 interfaces according to the UTOPIA (Universal Test and 
Operation PHY Interface for ATM) level 2 standard. One 
is UTOPIA interface in 8 bit wide data path mode and 
the other is UTOPIA interface in 16 bit wide data path 
a:- mode. ' 

25 The transmission path 50 consist of several func- 
tional blocks which act together to perform high speed 
transmission. 

The first block in the transmission path 50 is the Tx 
address translator 54, which places the outgoing data 

30 stream from the PCI FIFO 52 into host-specified mem- 
ory locations of a stream buffer 44 allocated in an exter- 
nal RAM 42. This allows for controlled "scattering" of da- 
ta into non-contiguous memory, which is useful for an 
operation which to some extent resembles so-called 

36 RAID (Redundant Array of Inexpensive Disks) operation 
which ensures integrity of data streams and TCP/IP 
packetisation. 

The TCP/IP checksum block 56 provides hardware 
support for calculating TCP/IP checksums. Its function 

40 is to calculate and maintain a partial checksum for each 
packet until all data has been transferred. The TCP/IP 
checksum block 56 works together with the Tx address 
translator 54 to create TCP/IP packets directly in the 
stream buffer 44. The TCP/IP header and pay load of the 

45 packets are placed in the stream buffer 44 separately, 
passing through the checksum block 56 which keeps a 
partial checksum. As soon as all data is in the stream 
buffer 44, the checksum value is placed in the correct 
position of TCP/IP header, the packet is ready for trans- 

50 mission. 

The RAM interface 58 forms an interface between 
the external RAM 42 and the transmission path 50. The 
external RAM 42 may comprise dual ported SDRAM 
(Synchronous Dynamic RAM). This external RAM 42 in- 

55 dudes several stream buffers 44 to decouple the bursty 
data traffic from the disks of the storage devices 20 and 
provides the required constant bit rate data streams to 
the ATM-network 1 6. Each stream buffer handles one 
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outgoing data stream. In the contrast to the incoming 
direction, since the data flow characteristics in the out- 
going direction are fully predictable (controllable), the 
buffer requirements can be estimated beforehand. 
Therefore thu stream buffers 44 are .statically allocated 
in the external RAM 42. 

The Tx RAID or SDI (Stream Data Integrity) block 
60 provides support 'for data' redundancy. The Tx"ad- 
dress translator 54 places the data as needed in stream 
buffer 44. Then, as data is output from the stream buffer 
44, the Tx RAI D block 60 corrects error data in the event 
that one of the disks in the storage devices 20 breaks 
down. 

The traffic shaper 62 controls the streaming of the 
outgoing data from the stream buffers 44 to the ATM 
network 16. It is designed for very accurate rate pacing 
and low CDV (Cell Delay Variation). The traffic shaper 
62 consists of two main sections. One section handles 
high priority data such as video traffic, and the other sec- 
tion handles general data traffic of low priority. 

The command block 66 is intended to off-load the 
host especially 28 of real-time sensitive jobs. It performs 
actions triggered by the transmission of the content of 
rfexact known locations^nlhe outgoing data stream. 

The segmentation block 70 segments the outgoing 
data stream provided from the stream buffer 44 into 
AAL-5 PDUs (ATM Adaptation Layer -5 Protocol Data 
Units), and maps the AAL-5 PDUs into ATM cells. In 
case the outgoing data stream is MPEG-2 SPTS (Single 
Program Transport Stream), the segmentation block 70 
is able to segment two TS packets in the MPEG-2 SPTS 
to one AAL-5 PDU, unless there are less than two TS 
packets left in the MPEG-2 SPTS, in the latter case the 
AAL-5 PDU maps into eight ATM cells. In the general 
case, the AAL-5 segmentation is controlled by the PDU 
size which is programmable per stream. 

The reception path 80 has several blocks corre- 
sponding to the reverse operation of the blocks of trans- 
mission path 50. 

A VPI/VCI (Virtual Path Identifier/Virtual Channel 
Identifier) filtering block 84 performs fast and efficient 
VPI/VCI filtering of the incoming ATM cells. This is done 
by a combined hash and linear search functions over 
the entries in a VPI/VCI table. 

A reassembly block 86 basically performs the in- 
verse functions of the segmentation block 70. The reas- 
sembly block 86 reconstructs the AAL-5 PDUs using 
payload of the ATM cells, then maps the AAL-5 PDUs 
into the upper layer data (e.g., MPEG-2 SPTS, TCP/IP 
Packets). 

A TCP checksum verification block 88 verifies the 
TCP checksum in the TCP header if the incoming data 
stream is transmitted via TCP. 

A pattern detector 92 allows a limited number of bit 
patterns to be detected in an incoming data stream. A 
list is created, indicating exactly where the specified bit 
patterns occur in the stream. This supports certain 
processing tasks that can be performed on-the-fly 



whereas they would otherwise have to be done with 
post-processing. 

A Rx RAID or SDI block 90 adds redundancy to the 
incoming data stream. If a sequence of N words is writ- 
5 ten to a buffer (not shown), the parity over these N words 
is written next. This function can be turned on/off. If the 
incoming data stream will be stored in the storage de- 
vice'20"and transmitted later as s TCP/IP packets via the 
transmission path 50, the function is turned off. 
70 A RAM interface 94 is an interface between the re- 
ception path 80 and an external RAM 46. The external 
RAM 46 may comprise dual ported SDRAM. The exter- 
na! RAM 46 is used as several stream buffers 48 storing 
incoming data streams. Each stream buffer 48 handles 
*s one incoming data stream. Incoming data streams can 
have unpredictable properties. For instance, some of 
data packets can be very bursty. This means the re- 
quired buffer capacity varies from stream to stream and 
from time to time. Therefore, In the external RAM 46, a 
^0 dynamic buffer allocation is preferred. 

A Rx address translator 96 provides appropriate 
read addresses to the stream buffer 48. 

The details of the major blocks in the streaming en- 
g in e 36 are d escri bed be low. - 

25 

Tx Address Translator 

The outgoing data stream is provided from the stor- 
age device 20 to the streaming engine 36 in burst trans- 

30 mission over the PCI bus 24. The purpose of the Tx ad- 
dress translator 54 is to scatter one contiguous DMA 
burst in appropriate areas of the stream buffer 44. 

Figure 3 shows a block diagram of the Tx address 
translator 54. Before one contiguous DMA burst from 

55 the storage device 20 arrives, the correct starting ad- 
dress is written to a register 102 via a storage device 
controller 22. The content of the register 102 is used as 
write address for the stream buffer 44. A counter 106 
counts the number of bits of the outgoing data stream 

40 from the PCI Fl FO 52. Each time a data word consisting 
of 32 bits passes the counter 106, it inform a increment 
controller 104 that a word is transferred to the stream 
buffer 44. With each new word, the increment controller 
104 increments the content of the register 102 with 

45 ADDRESSJNCREMENT, which is a programmable 
value. In case of the outgoing data stream being RAID 
processed data, the value of ADDRESSJNCREMENT 
is basically set according to the number of disks used 
for RAID system. In case of the outgoing data stream 

so being payload of a TCP/IP packet, the value of 
ADDRESSJNCREMENT is basically set according to 
packetisation parameters. 

An address translation when the outgoing data 
stream is RAID processed data, is described below with 

55 reference to Figure 4. In this example, the RAID or SDI 
system consists of four disks Disk 0, Disk 1 , Disk 2 and 
Disk 3. The Disk 0 contains words 1 , 4, 7. . . to be trans- 
mitted to the local ATM switch 14. The Disk 1 also con- 



7 



EP 0 847 216 A1 



8 



tains words 2, 5, 8... to be transmitted to the local ATM 
switch 14. The Disk 2 also contains words 3, 6, 9... to 
be transmitted to the local ATM switch 1 4. The Disk 3 
contains parity words 0, 1 , 2... for error correction. Each 
parity word (e.g., parity 0) has been generated in the Rx 
RAID block 90 from three words (e.g., words 1, 2 and 
3) which constitute so-called stripe unit of RAID together 
with the parity word. '" ' ***••■■ 

In the event of failure in one of the disks (e.g., Disk 
2), one contiguous DMA burst including parity words is 
transferred to the Tx address translator 54. For ease of 
explanation, assume that the size of one contiguous 
DMA burst is 96 bytes (24 words), although the actual 
size can be larger than 100k bytes (depending on the 
speed of the hard- and/or software). In this case, the 
contiguous DMA burst 1 20 consists of words 1 , 4, 7, 1 0, 
13, 16 from the Disk 0, words 2, 5, 8, 11, 14, 17fromtha 
Disk 1, words 3, 6, 9, 12, 15, 18 from the Disk 2, and 
parity words 0, 1,2, 3, 4, 5 from the Disk 3. The Tx- 
address translator 54 generates the following sequence 
of addresses: 

178, 182, 186, 190, 194, 198 (data from DiskO) 

179, 183, 187, 191, 195^199 (data from Diskl) 

180, 184, 188, 192, 196, 200 (data from Disk 2) 

181, 185, 189, 193, 197, 204 (data from Disk 3) 

More specifically, before the contiguous DMA burst 
1 20 arrives at the stream buffer 44, a value 1 78 is stored 
in the register 102 as the starting address. Then the 
word 1 form Disk 0 is written in the address 178 in the 
stream buffer 44. When the word 1 passes the counter 
106, the increment controller 104 increments the value 
178 in the register 102 with ADDRESSJNCREMENT 
of a value 4 corresponding to the number of disks. Then 
the word 4 from the Disk 0 is written in the address 182 
in the stream buffer 44. When the word 4 passes the 
counter 106, the increment controller 104 increments 
the value 182 in the register 102 with 
ADDRESSJNCREMENT of a value 4. Then the word 7 
from the Disk 0 is written in the address 186 in the 
stream buffer 44. Similarly, remaining words 10, 13 and 
16 from Disk 0 are written in the addresses 190, 194, 
198 which are the number of disks apart in the stream 
buffer 44. 

When the word 16 from Disk 0 passes the counter 
106, the increment controller 104 increments the value 
198 in the register 102 with ADDRESSJNCREMENT 
of a value - 19. Then the word 2 from Disk 1 is written 
in the address 179 in the stream buffer 44. When the 
word 2 passes the counter 106, the increment controller 
104 increments the value 179 in the register 102 with 
ADDRESSJNCREMENT of a value 4. Then the word 5 
from Disk 1 is written in the address 183 in the stream 
buffer 44. When the word 5 passes the counter 106, the 
increment controller 104 increments the value 183 in the 
register 102 with ADDRESSJNCREMENT of a value 4. 
Then the word 8 from Disk 1 is written in the address 



187 in the stream buffer 44. Similarly, remaining words 
from Disk 1 are written in the addresses 191, 195, 199 
which are the number of disks apart in stream buffer 44. 
In the same way, words from the Disks 2 and 3 are 
s written in appropriate addresses in the stream buffer 44. 
The words written in the stream buffer 44 are read in 
liner fashion and provided to the Tx RAID block 60 to 
correct* errors." ' ' " 

When the outgoing data stream from the storage 

w device 20 is TCP/IP payload, the address translator 54 
and the TCP checksum calculation block 56 work close- 
ly together to provide support for TCP/IP packet gener- 
ation. The host 28 pre-programs the Tx address trans- 
lator 54 so that data is distributed according to a spec- 
is ified packet size. At first the host 28 needs to know all 
the packetisation parameters. Important parameters for 
this operation are TCP payload size, TCP header size, 
IP header size and IP payload size. TCP header and IP 
header basically have space for optional data but this is 

20 in practice not used. Therefore, a simplification can be 
introduced by assuming default sizes for the headers: 
TCP header size is 5 words (20 bytes) and IP header 
size is 5 words (20 bytes). 
*---»•• «■ The mechanism can be described as follows. 

25 The host 28 itself does a partial checksum calcula- 
tion over the pseudo-header of the TCP/IP header. Then 
it initializes a TCP checksum register 57 in the TCP 
checksum block 56 for that TCP/IP packet with this val- 
ue. Space for the stream buffer 44 also will be reserved 

30 in the external RAM 42 to fit the full TCP packet plus the 
TCP and IP header overhead. 

The host 28 will then instruct the increment control- 
ler 104 in theTx address translator 54 with the TCP pay- 
load size, TCP header size, IP header size and IP pay- 

36 load size. The TCP payload can then be sent as one 
contiguous DMA burst over the PCI bus 24 and placed 
into the area in the stream buffer 44 reserved for it by 
the Tx address translator 54, leaving space for the head- 
ers. As it goes from the- PCI bus 24 to the stream buffer 

40 44, the checksum calculation block 56 updates the par- 
tial checksum in the TCP checksum register 57. Note 
that with this method the payload, representing usually 
the bulk of the TCP/IP packets, does not need to be cop- 
ied first from the storage devices 20 to the host memory 

45 32 for processing it and then to the stream buffer 44. 
This saves valuable bus bandwidth and overhead for the 
host CPU 30. After the payload has been written, the 
header information, prepared by the host 28, is sent to 
the stream buffer 44 via the address translator 54. As 

50 with the payload, the Tx address translator 54 places 
the header in the previously reserved memory locations. 

This sequence can be reversed, whereby the head- 
er information is written first and the payload second 
In either case, when both the header and the pay- 

55 bad have been written, the TCP checksum will be com- 
plete and can be copied to the correct location automat- 
ically. 

This mechanism can also be used to efficiently sup- 
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port segmenting of a TCP packet into multiple smaller 
IP packets. In this case, space is reserved for each IP 
packet. The TCP packet data (header + pay load) is seg- 
mented into these packets and the header of each IP 
packet is wtilten by the hosl 28. 

All IP packets will be the same size except for the 
last block, which is likely to have a different size than 
"the others. The address translator 54 : iakes this in to ac 4 ^ 
count. After the complete TCP/IP packet(s) has been 
formed, it is ready for transmission. 

Figure 5A, 5B and 5C shows an example of address 
translation for TCP/I Ppacketisation. In this case, before 
the TCP/IP payload sent as one contiguous DMA burst 
1 30 arrives at the stream buffer 44, a value 31 0 is stored 
in the register 102 as the starting write address, then the 
first word of the first data is written in the address 310 
in the stream buffer 44. When the first word of the first 
data passes the counter 106, the increment controller 
104 increments the value 310 in the register 102 with 
ADDRESSJNCREMENT of value 1. Then the second 
word of the first data is written in the address 311 in the 
stream buffer 44. When the second word of the first data 
passes the counter 106, the increment controller 104 in- 
crements the f tvalue 3M*i?in*the register 102 with 
ADDRESSJNCREMENT of value 1. Then the third 
. word of the first data is written in the address 312 in the 
stream buffer 44. The increment with 
ADDRESSJNCREMENT of value 1 is repeated a 
number of times corresponding to the IP payload size. 
Thus the first data of the TCP/IP payload is written in an 
appropriate area. 

Then the increment controller 104 increments the 
content in the register 102 with 
ADDRESSJNCREMENT of a value corresponding to 
IP header size. Then the writing of second data starts 
from the address according to the content of the register 
1 02. Thus the address translator 54 generates write ad- 
dresses for the payload so that the space for the head- 
ers are left. The last data is likely to have a different size 
than the others. The size of the last data is calculated 
in the increment controller 104 by the following expres- 
sion: 

The last data size = TCP payload size mod IP payload 
size 

Therefore the number of increment is controlled taking 
the last data size into account. In this way, the payload 
sent as one contiguous DMA burst is scattered in the 
shaded areas in the stream buffer 44 shown as Figure 
5A. 

Next, When the TCP header 1 32 is sent as one con- 
tiguous burst over the PCI bus 24, the address translator 
54 generates write addresses corresponding to the pre- 
viously reserved memory locations for the TCP header 
in the stream buffer 44. 

More specifically, before the TCP header sent as 
one contiguous burst 132 arrives at the stream buffer 
44, a value 305 is set in the register 102 as the starting 
write address, whereafter the first word of the TCP head- 



er is written in the address 305 in the stream buffer 44. 
When the first word of the TCP header passes the coun- 
tei 106, the increment controller 104 increments the val- 
ue 305 in the register 102 with 
5 ADDRESSJNCREMENT of value 1 . Then the second 
word of the TCP header is written in the address 306 in 
the stream buffer 44. When the second word of the TCP 

' ^.tteader passes the counter 106, the increment controller 
104 increments the value 306 in the register 102 with 

10 ADDRESSJNCREMENT of value 1. Then the third 
word of the TCP header is written in the address 307 in 
the stream buffer 44. The increment with 
ADDRESSJNCREMENT of value 1 is repeated a 
number of times corresponding to the TCP header size. 

15 Thus the TCP header is written in the shaded area in 
the stream buffer 44 shown as Figure 5B. 

Next, When the IP headers 134 are sent as one con- 
tiguous burst over the PCI bus 24, the address translator 
54 generates write addresses corresponding to the pre- 

20 viously reserved memory locations for the IP headers in 
the stream buffer 44. 

More specifically, before the IP headers sent as one 
contiguous burst 1 34 arrives at the stream buffer 44, a 
■ value 300 is set in the register 102 as the starting write 

25 address, whereafter the first word of the first IP header 
is written in the address 300 in the stream buffer 44. 
When the first word of the first IP header passes the 
counter 106, the increment controller 104 increments 
the value 300 in the register 102 with 

30 ADDRESSJNCREMENT of value 1. Then the second 
word of the first IP header is written in the address 301 
in the stream buffer 44. When the second word of the 
first IP header passes the counter 106, the increment 
controller 104 increments the value 301 in the register 

35 102 with ADDRESSJNCREMENT of value 1. Thenthe 
third word of the first IP header is written in the address 
302 in the stream buffer 44. The increment with 
ADDRESSJNCREMENT of value 1 is repeated a 
number of times corresponding to the IP header size. 

40 Then the increment controller 104 increments the 
content in the register 102 with 
ADDRESSJNCREMENT of a value corresponding to 
TCP header size + IP payload size. Then the writing of 
second IP header starts from the address according to 

45 the content of the register 1 02. Thus the IP headers are 
written in the shaded areas in the stream buffer 44 
shown as Figure 5C. 

Next, the TCP checksum completed by the TCP 
checksum block 56 is copied to the correct location. 

50 in this way, the TCP/IP packetisation is completed 
and can be read from the stream buffer 44 in linear fash- 
ion. 

In the above embodiment, TCP/IP packetisation is 
mentioned. However it is possible to use UDP (User Da- 
55 tagram Protocol) instead of TCP. In this case, the default 
size of UDP header is 2 words (8 bytes). 

In addition, in the above embodiment, the TCP 
header and the IP headers are sent as different bursts 
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from the host 28 to the Tx address translator 54. How- 
ever it is possible to send the TCP header and the IP 
headers together as one contiguous burst from the host 
28 to the Tx address translator 54. 

Tx RAID or SDI block 

In tho sequence of words irTthe stream buffer 44, ' 
parity words may be inserted. This redundancy provides 
a means for correcting errors. The Tx RAID or SDI block 
60 takes in a sequence of N+1 words of which the last 
word is the parity over the N first words. In case it is 
indicated by hardware and/or software, that word M is 
corrupt, e.g., because of a disk failure, the parity word 
is retrieved from the storage device 20 and used to re- 
construct the word M. 

For example, in case of Figure 4, the words 3, 6, 9, 
12, 15, 18 from the failure Disk 2 in the input data 142 
include error shown as Figure 6. The Tx RAID block 60 
reconstruct the word 3 using the words 1 , 2 and the par- 
ity word 0. The Tx RAID block 60 reconstruct the word 
6 using the words 4, 5 and the parity word 1 . Similarly, 
the words 9, 1 2, 1 5, 1 8 are reconstructed by the Tx RAI D 
r-blcck 60.tT^gB^^ rror cor- 

rection and outputs the sequence 1 42 of the words 1 , 2, 
3, 4 ... without errors. 

The RAID function can be turned on/off by the com- 
mand block 66. 

Traffic Shaper 

The traffic shaper 62 consists of two main sections 
one section handles high priority data such as video traf- 
fic, and a low priority section handles general data traf- 
fic. 

The high priority section is organized into several 
traffic classes, in which a class is a group of one or more 
streams having the same bit rate characteristic. For ex- 
ample, all streams of a C BR (Constant Bit Rate) at 2 
Mbps belong to the same class. A class of VBR (Variable 
Bit Rate) type typically contains only one stream, be- 
cause it is unlikely that two VBR streams have identical 
bandwidth patterns at all times. Each class has a single 
set of transmission parameters for controlling the bit 
rate, providing for low CDV (Cell Delay Variation) and 
accurate rate pacing. The number of classes is pro- 
grammable but limited to maximum 128. 

Each class has two main transmission parameters, 
an ideal scheduled time (TS) and an increment (A) for 
the TS. The basic mechanism is that when TS becomes 
equal to or less than a reference clock, a stream pointer 
is put into the transmit queue. At the same time the value 
TS is incremented with the value A. The transmit queue 
is a first in first out queue that will submit the stream 
indicated by the stream pointer the ATM fifo 72 as soon 
as possible. 

In the high priority section a high accuracy bit rate 
and low CDV are achieved following mechanisms. 



Due to the finite resolution of the reference clock, 
having a single A value usually does not give the desired 
accuracy. To achieve the desired accuracy, two A values 
are used alternatively that are just one counter value 

5 apart. These two values result in a rate that is slightly 
above and below the required bit rate. Using each A val- 
ue for different numbers of cells compensates for the 
limited clock resolution and can provide arbitrary accu- 
racy. Ah and A L (where A^Ah+1 ) represent the two dif- 

10 ferent increment values. The N H and N L parameters rep- 
resent the number of cells for which the corresponding 
increment value are alternatively valid. By means of this 
mechanism, the stream is modulated whereby the av- 
erage bit rate approaches the required bit rate within the 

is desired accuracy. Figure 7 shows a behavior of a bit rate 
achieved by this mechanism. In Figure 7, N H cells are 
sent at Ah and N L cells are sent at A L . This sequence is 
repeated cyclically. Thus the average bit rate shown by 
a dotted line is maintained as a long term bit rate. 

20 Low CDV is achieved by reducing the collisions in 
scheduling times of cells from different streams. A major 
cause of collisions in many existing traffic shaping 
mechanisms is that streams of the same bit rate are 
scheduleatGMhe same time. This-is particularly a prob- 
es fern when there is a large number of streams and a low 
number of independent bit rates. This problem is ad- 
dressed in the preferred embodiment by evenly spacing 
the cells of streams belonging to the same class. In other 
words, if the increment for one stream should be A, the 

30 increment for the class is A/n, where n is the number of 
streams in a class. Every time a class is to be serviced, 
the data is taken from successive streams. For example, 
If cells of the stream 0 belonging to Class 0 should be 
incremented by A, each cell of the streams (i.e. stream 

55 o - stream n-1) belonging to the same class 0 is sent 
with the space of A/n shown as Figure 8. 

By the combination of the above two mechanisms, 
a high accuracy bit rate and low CDV are achieved. If 
the transmit queue does not get blocked, the cells are 

40 submitted shown as Figure 9. 

The high priority section also handles VBR traffic. 
The traffic shaper 62 supports smooth update of the 
transmission parameters. This update can be done by 
the host 28 but also by the command block 66. The com- 

45 mand block 66 is programmed by the host 28, and its 
actions are triggered when an exact location is transmit- 
ted from the stream buffer 44. One such action is to re- 
place the transmission parameters for a specified 
stream in the traffic shaper 62. As soon as the data just 

so before a change in bit rate are sent, the command block 
66 updates the parameters. Once it is set-up, this proc- 
ess is autonomous and does not require interaction of 
the host CPU 30 anymore. As a consequence, the host 
28 does not need to interact exactly at the moment in- 

55 teraction would be required. In this way the real-time 
character of the stream is maintained and the host load 
kept to a minimum. 

The low priority section is organized into e.g. fixed 
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32 traffic classes, in which a class is a group of one or 
more streams having the same PGR (Peak Cell Rate). 
In terms of the general data traffic, the real-time con- 
straints are much less significant. The main objective of 
the tinffic shaping of the low priority section is to limit 
the PCR in oider to avoid network policing. The traffic 
shaping of the data packets is implemented by a mech- 
anism using an ideal schedulea 1 time (TS) and an incre- 
ment (A) for the TS, which is similar to the basic traffic 
shaping mechanism in the high priority section. Howev- 
er, scheduling of data packets gets a lower priority than 
real time traffic. Only if the transmit queue of the high 
priority section is empty, a stream of the data packets 
can be submitted to the ATM FIFO 72. 

The mechanism is implemented with an architec- 
ture shown as Figure 10. The traffic shaper 62 consists 
of the high priority section 200 and the low priority sec- 
tion 202 as mentioned above. 

In the high priority section 200, a memory 203 
stores a set of new transmission parameters for each 
class provided from the host 28. Each set of the new 
transmission parameters consists of TS j( A^, A L i, N H i, 
NJ and Ptj (where 0 <, i s 127). In this embodiment Ptj 
' j|paln$*6n^ 
one or more streams attached to the class i. A memory 
206 stores current transmission parameters. When a 
command is instructed by the host 28 or the command 
block 66, an update logic 204 is triggered by the com- 
mand, whereby the current transmission parameters in 
the memory 206 are updated with the new transmission 
parameters in the memory 203. A register 212 stores a 
parameter Nr_Classes indicating the number of class 
from the host 28 at receipt thereof. A traffic logic 208 
checks for each of classes from 0 to Nr_Classes-1 
whether TSj is equal to or less than the current time in- 
dicated by a reference clock 21 0. If so, the stream point- 
er of the first stream attached to this class i is inserted 
to a high priority transmit queue 21 6 and TSj in the mem- 
ory 206 is incremented with A H i or AJ of this class i by 
the traffic logic 208. The A^ and AJ are alternated ac- 
cording to the N H i and NJ. Then the segmentation block 
70 receives the stream pointer from the high priority 
transmit queue 216 and puts a ATM cell belonging to 
the stream indicated by the stream pointer into ATM 
FIFO 72. 

In the low priority section 202, a memory 218 stores 
a set of transmission parameters for each class provid- 
ed from the host 28. In this embodiment each set of the 
transmission parameters consists of TSj, Aj and Ptj 
(where 0 < j <> 31 ). Ptj contains one or more stream point- 
ers which indicate one or more streams attached to the 
class j. A traffic logic 220 checks each of classes from 
0 to 31 if TSj is equal to or less than the current time 
indicated by the reference clock 210 and monitors 
where the high priority transmit queue 216 is empty. If 
so, the stream pointer of the first stream attached to this 
class j is inserted to a low priority transmit queue 222 
and TSj in the memory 21 8 is incremented with Aj of this 



class j by the traffic logic 220. Then the segmentation 
block 70 receives the stream pointer from the low priority 
transmit queue 222 and puts a ATM cell belonging to 
the stream indicated by the stream pointer into ATM 

5 FIFO 72. 

In the above embodiment, a traffic shaping mecha- 
nism being similar to the mechanism of high priority sec- 
tion 200 is applied to the low priority section 202. How- 
ever conventional leaky bucket mechanism may be ap- 

*0 plied to the traffic shaping mechanism of the low priority 
section 202. 

Command Block 

*5 Real time data delivery may sometimes involve ac- 
tions occurring at specific locations in the outgoing data 
stream. These actions must be immediate in order to 
maintain the integrity of the stream. Due to the many 
responsibilities of the host 28, timely interaction cannot 

20 always guaranteed. In the preferred embodiment, it is 
the responsibility of the command block 66 to perform 
these interactions. In principle, the host 28 knows ex- 
actly where in the outgoing data stream the streaming 
- *-paraftie%W;need, stream 

25 buffer 44 is allocated statically as mentioned above, it 
is possible to express a location, where the actions 
should be taken, in read pointer of the stream buffer 44. 
The host 28 loads a list of the command block 66 with 
a number of instructions at the appropriate moment in 

30 time. The appropriate time is the time between the load- 
ing of the outgoing data stream to the stream buffer 44 
and the time that the outgoing data stream is sent out 
from the stream buffer 44. The command block 66 scans 
the read pointer of the stream buffer 44. If a match with 

35 a specified address is found, a command that is linked 
to that address will be executed and the command will 
be purged from the command block 66. 

The command block 66 triggers on the address of 
the data leaving the stream buffer 44. When reading a 

40 stream buffer 44, the read pointer is gradually incre- 
mented with a wrap-around. Each stream has a linked 
list that contain the (address, command) pairs to be 
stored according to the address. An address is a pair (L, 
M) indicating the address in the file and is independent 

45 from the physical address. L is the number of blocks, 
with a block size equal to the size of the stream buffer 
44. M is sequence number in the last block. Each stream 
maintains a WAC (Wrap-around counter) that counts the 
number of times the read pointer has been wrapped 

50 around. An address match is found if 

L=WAC and M=Read Pointer - Buffer Offset 

55 This mechanism is implemented as follows. Figure 
11 shows a block diagram of the command block 66. 
The command block 66 consists of several command 
generators 300. Each command generator 300 handles 
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the commands for each outgoing data stream. The host 
28 loads a list of commands in the command register 
316 in each command generator 300 at the appropriate 
moment in time. 

In a command generator 300, a regislef 302 stores 5 
the Buffet Offset. A comparator 304 compares the Buffer 
Offset in the register 302 with the read pointer of the 
stream buffer 44! When a wrap-around occurs, the read- » 
pointer takes the Buffer Offset. Therefore when the 
match is detected by the comparator 304, the WAC io 
(Wrap-Around Counter) 306 is incremented. The com- 
parator 308 compares the count of the WAC 306 with 
current L provided from the command register 316. A 
comparator 310 compares current M provided from the 
command register 316 with the read pointer - Buffer Off- 
set. When the matches are detected by the comparator 
308 and the comparator 310, the AND gate 312 de- 
queues a current command stored by a queue 314. 
Each time a current command corresponding to a cur- 
rent address (L, M) is output from the queue 31 4, a com- 
mand corresponding to a next address is queued in the 
queue 314 from the command register 316. Thus each 
command generator 300 instructs commands according 
^g^adtpointer of the strfjamibi^an 

The commands to be instructed from the command 
block 66 are: 

Change bit rate: This command will allow to change a 
stream bandwidth. When this command is instructed, 
the traffic shaper 62 detach a stream from its current 
class, updates the A values for the current class, attach 
the stream to a new class and update the linked A values 
of the new class. Thus the bit rate of individual steams 
is changed at specific stream locations. This is useful 
for MPEG bit stream of VBR (variable bit rate), for ex- 
ample. 

Insert RCI: This command allows to insert an RCI 
(Rate Change Indicator) at specific location in the 
stream. The RCI is able to notify the distant terminal (e. 
g., STB 18) the rate changes at that moment and aids 
clock recovery for MPEG decoders. The detail of the 
RCI is described as "data rate data" in the European 
Patent Application EP 0 712 250 A2. When this com- 
mand is instructed, the RCI generator 68 generates the 
RCI and the segmentation block 70 terminates the cur- 
rent segmentation and a separate AAL-5 PDU (one ATM 
cell) for the RCI is generated. This is useful for MPEG 
bit stream of VBR. 

Enable RAID: This command set the appropriate 
parameters in the Tx RAID block 60 for the error correc- 
tion. 

Disable RAID: This function is the inverse of the 
above Enable RAID. 

Perform Byte Swap: This command allows to cope 
with little endian / big endian problems between the 
server 10 and the STB 18. When this command is in- 
structed, the byte swapper 64 reorders the bytes within 
a word 350 in the outgoing stream in an order of a word 
352 shown as Figure 12. 



Enable different PDU-Size: TCP can require seg- 
mentation. One TCP packet is to be divided in different 
IP-packets. The last IP packet requires usually a differ- 
ent AAL-5 PDU size than the previous one. When this 
command is instructed, the segmentation block 70 
change the AAL-5 PDU size. 

Interrupt CPU: This is the most general function. It 
-requests the host CPU 30 interaction upon detection of 
a certain location in the stream. 

VPI/VCI Filtering Block 



Figure 13 shows a format of one ATM cell used in 
UNI (User Network Interface). One ATM cell consists of 
is 53 bytes. First 5 bytes constitute a ATM header and the 
remaining 48 bytes carry pay load. The first 4 bits in the 
ATM header is called GFC (Generic Flow Contral). The 
following 24 bits in the ATM header is called VPI/VCI. 
Actually, the VPI/VCI consists of VPI of 8 bits and VCI 
20 of 16 bits. The following 3 bits in the ATM header is 
called PT (Payload Type). The following 1 bit in the ATM 
header is called CLP (Cell Loss Priority). The last 8 bits 
in the ATM header is called HEC (Header Error Control). 
^ -«*aRie VPI/VCI filtering- b1ocki84 receives suGh.ATM^fe&V"-" 
25 from ATM FIFO 82. 

The VPI/VCI filtering block 84 determines whether 
a VPI/VCI of the received ATM cell is an element of the 
set of VPI/VCIs that should be accepted, determines to 
which stream the ATM cell belongs, and filters OAM 
30 (Operation, Administration and Maintenance) F5 cells. 
To achieve this filtering process, a VPI/VCI translation 
from a VPI/VCI to an internal stream ID is performed in 
a VPI/VCI translator 85 in the VPI/VCI filtering block B4. 
The object of the VPI/VCI translation mechanism is 
35 to allow as wide a range of legal VPI/VCIs as possible, 
while at the same time facilitating fast translation. Pref- 
erably, all VPI/VCIs should be admissable. The VPI/VCI 
translation can be done using conventional binary 
search techniques. However, due to time constraints, 
40 the largest acceptable search is of the order of a binary 
search of 512 entries. On the other hand, the maximum 
number of active VPI/VCIs should be greater than 512 
to support simultaneous communications with a large 
number of clients. 
45 in order to meet the object, the VPI/VCI table is di- 
vided up into sections of 512 entries. Each entry indi- 
cates a relation between a VPI/VCI and an internal 
stream ID is entered into a certain section depending on 
a distribution mechanism and within each section the 
so entries are ordered. 

Upon reception of a ATM cell, once the correct sec- 
tion has been found, a binary search can be performed 
over that section to find the correct entry. Therefore the 
distribution mechanism to distribute the VPI/VCIs must 
55 allow immediate indexing into a section according to the 
VPI/VCI. Moreover, to allow for efficient use of the VPI/ 
VCI table, the mechanism must allow for a wide distri- 
bution of the VPI/VCIs. In other words, the mechanism 



9 



17 



EP0 847 216A1 



must distribute the entries as randomly as possible over 
the entire VPI/VCI table. If a VPI/VCI maps intoa section 
of the VPI/VCI table that is already full, it must be reject- 
ed even though there may be space in other sections. 

One distribution mechanism that lils la the require- 
ments is to simply use the lower X (where X is integer; 
e.g., 3) bits of the VC! as hash key to index into the VPI/ 
VCI table. It is reasonable thaf whefrthsre^ei re *a large 
number of active VP/VCs the lower bits will be the most 
random of the 24 bits VPI/VCI field and allow for an even 
distribution. 

Using this type of mechanism, the requirements of 
fast look up and no illegal or inadmissable VPI/VCI s are 
met. The mechanism is implemented as follows. 

Figure 14 shows a block diagram of the VPI/VCI 
translator 85. When a new VP/VC become active, a new 
entry indicating a VPI/VCI of that new VP/VC and an 
internal stream ID corresponding to that VPI/VCI is en- 
tered into a section according to the lower 3 bits of the 
VCI (i.e., bits 7, 6, 5 of 4-th byte in Figure 1 3) via a hash 
function 400. More specifically, if the lower 3 bits of the 
VCI is 000, the entry is stored in the section 1 in the VPI/ 
VCI table 402. If the lower 3 bits of the VCI is 001, the 
^tfy;is«tored in the section 2 in theWPIA^CI table 402.* 
If the lower 3 bits of the VCI is 010, the entry is stored 
in the section 3 in the VPI/VCI table 402. Similarly, all 
new entries are stored in appropriate sections according 
to the lower 3 bits of the VCI. Thus, the VPI/VCI table 
402 of e.g. 4096 entries is divided up into 8 sections 
(section 1 - 8) of e.g. 512 entries. Within each section 
the entries are reordered in an ascending or descending 
order to implement a binary search. 

Upon reception of an ATM cell, the VPI/VCI of the 
received ATM cell is provided to a search engine 420 
and the hash function 400. The hash function 400 pro- 
vides a section index based on the lower 3 bits of the 
VPI/VCI to the search engine 420. Then a binary search 
is performed by the search engine 420 over a section 
corresponding to the section index to find the correct en- 
try. For example, if the lower 3 bits of the VCI of the re- 
ceived ATM cell is 01 0, the hash function 400 provides 
3 as the section index to the search engine 420. Then, 
the search engine 420 performs a binary search over 
the section 3 to find a correct entry and outputs a internal 
stream ID of the found entry. If the lower 3 bits of the 
VCI of the received ATM cell is 111, the hash function 
400 provides 8 as the section index to the search engine 
420. Then, the search engine 420 performs a binary 
search over the section 8 to find a correct entry and out- 
puts a internal stream ID of the found entry. The output 
internal stream ID is used for the filtering process. 

In the above embodiment, the lower 3 bits of the 
VPI/VCI field is simply used as a section index. Howev- 
er, a more complex hash function may be used over the 
VPI/VCI field to generate a section index. 

In the above embodiment, when a new VP/VC be- 
comes active, the new entry is entered to an appropriate 
section via the hash function 400. However, it is possible 



to create a new VPI/VCI table including the new entry 
in the host 28 having a hash function of the same mech- 
anism as the hash function 400, transfer the new VPI/ 
VCI table to the VPI/VCI translator 85 and update the 
5 VPI/VCI table 402 with the new VPI/VCI table. 

Pattern Detector 

The host 28 knows what kind of data incoming in 

10 over a specific VC. The host 28 instruct the patten de- 
tector 92, per VC, which pattern is to be scanned for. 
The purpose of the pattern detector 92 is to detect a 
preset bit pattern in the incoming data stream. Each time 
a match is detected, the pattern detector 92 informs the 

is host 28 the "data detected" state. When the host 28 re- 
ceives the information of the detection, it adds the ad- 
dress at which it occurs to a list in the host memory 32. 
As the detection itself is done automatically, the host 28 
can perform other jobs in the mean time. The host 28 

20 only needs to be interrupted in case the pre-set bit pat- 
tern is detected and the action can be taken. 

Figure 1 5 shows a block diagram of the pattern de- 
tector 92. Before the incoming data stream is transmit- 
■ -4 ted through the reception ^pathjfl^Sftfe host 28 instructs 

25 the pattern detect controller 506, per VC t which pattern 
is to be scanned for. The pattern detect controller 506 
can set 4 pre-programmed bit pattern of 32 bits wide in 
register 504 for each stream. The alignment circuit 500 
performs byte alignment of incoming data stream. The 

30 matching circuit 502 performs byte aligned matching 
against 4 pre-programmed bit patterns per stream. Each 
time the match is detected, the matching circuit 502 in- 
forms-the controller 506 of the detection. 

An example of the purpose of pattern detector 92 is 

3S to find locations of l-picture in video bit stream com- 
pressed according to the MPEG standard. In the MPEG 
bit stream, a picture immediately following GOP header 
is always l-picture. Therefore, It is possible to find a lo- 
cation of l-picture by detecting group_start_code (32 

40 bits) identifying the beginning of GOP header and 
picture_start_code (32 bits) identifying the beginning of 
picture header. 

For instance, when a MPEG bit stream of a movie 
is transferred from another SMU 1 2 in order to duplicate 

45 the movie, group_start_code and picture_start_code 
are set in the register 504 as pre-set bit patterns. The 
pattern detector 92 detects group_start_code and 
picture_start_code in the received MPEG bit stream. 
Each time picture_start_code is detected immediately 

50 after the detection of group_start_code in the matching 
circuit 502, the pattern detect controller 506 informs the 
detection state to the host CPU 30. The host CPU 30 
adds an address of storage device 20 in which the I- 
picture is stored to a list in the host memory 32. Thus 

55 the list indicting locations of l-picture is constructed dur- 
ing the MPEG bit stream flows in the reception path 80. 

The list is used for VCR-operation when the stored 
MPEG bit stream is transferred to the STB 18. If the STB 
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18 requests VCR operation (e.g., FF, FR), the host 28 
refers this list and instructs the storage device controller 
22 to access and retrieve the l-pictures. 

Using this feature, the data stored in the storage de- 
vice 20 is "raw" or not formatled for a specific applica- 5 
tion. This increases the "application independence" and 
interoperability of server system 1 0 (Figure 1 ). 

Rx Address Translator 



The purpose of the Rx address translator 96 is to 
gather different (non-contiguous) words from a stream 
buffer 46 and to create a burst data to the PCI bus 24. 
It is basically the inverse function of the address trans- 
lation of the Tx address translator 54. The difference is is 2. 
that in this case a dynamic buffer structure must be con- 
sidered. The burst data is transferred to the storage de- 
vice 20 or the host 28 via the PCI bus 24. 

Figure 16 shows an example of an address trans- 
lation applied to a incoming data stream to be stored in 20 
Disk 0, 1, 2 and 3 of the storage device 20. In this ex- 
ample, the following sequence of read addresses for the 
stream buffer 48 is generated by the Rx address trans- 
lator 96'to create'a'bum^8t^5^te^'».. > ■ <• 

25 



(d) a reception unit for transmitting incoming 
data from the ATM interface to the bus inter- 
face, the reception unit including 

(1) means for performing VPI/VCI filtering 
of incoming ATM cells, 

(2) means for reassembling the incoming 
''<tetB'Us!irig< pay load of the incoming ATM 

cells, and 

i.v-n. (3) a second RAM interface for interfacing 
RAM being used as a buffer for buffering 
the incoming data the means for reassem- 
bling. 

An apparatus according to claim 1, wherein the 
transmission unit comprises a command block for 
storing at least a pair of an address and a command, 
the address and the command being loaded from 
the host, and detecting a match between the ad- 
dress and an address specified by a read pointer of 
the buffer, the command being executed when the 
match is detected. 



178, 182, 13, 17, 1099, 1103 

179, 183, 14, 18, 1100, 1104 

180, 184, 15, 19, 1101, 1105 

181, 185, 16... 



(for Disk 0) 
(for Disk 1) 
(for Disk 2) 
(for Disk 3) 



Claims 

1. An apparatus for sending data to an ATM network 
and receiving data from the ATM network compris- 
ing: 

(a) a bus interface for interfacing with a bus 
supporting communication between a host, a 
storage device and the apparatus; 

(b) an ATM interface for interfacing with the 
ATM network; 

(c) a transmission unit for transmitting outgoing 
data from the bus interface to the ATM inter- 
face, the transmission unit including 

(1 ) a first RAM interface for interfacing with 
RAM being used as a buffer for buffering 
the outgoing data from the bus interface, 

(2) means for segmenting the outgoing da- 
ta from the buffer into outgoing ATM cells, 
and 

(3) a traffic ehaper, for controlling traffic of 
the outgoing ATM cells to the ATM interface 
in cooperation with the means for segment- 
ing; and 
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plus aeGUraVQI Ppiim'2v' wherein the traf- 
fic shaper changes a stream bandwidth of the out- 
going ATM cells according to the command. 

An apparatus according to claim 2 or 3, wherein the 
transmission unit comprises the means for inserting 
a Rate Change Indicator into the outgoing data read 
from the buffer according to the command. 

An apparatus according to claim 2, 3 wherein the or 
4 command causes an interrupt of the host. 

An apparatus according to any of claims 2-5, where- 
in the transmission unit includes means for swap- 
ping bytes of the outgoing data read from the buffer 
according to the command. 

An apparatus according to any of claims 2-6, where- 
in the means for segmenting applies AAL segmen- 
tation with PDUs to the outgoing data read from the 
buffer and changes the PDU-size according to the 
command. 

An apparatus according to any of claims 1 -7, where- 
in the transmission unit includes an address trans- 
lator for providing write addresses for a burst data 
from the storage device, at least a portion of the 
write addresses being non-contiguous, and the 
buffer stores the burst data according to the write 
addresses and outputs data therefrom in a linear 
fashion. 

An apparatus according to claim 8, wherein the 
burst data includes payload for a predetermined 
protocol, and the address translator receives pack- 
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etisation parameters including at least size of the 
pay load from the host and generates the write ad- 
dresses so that a space for a protocol header of the 
predetermined protocol is reserved in the buffer ac- 
cording lo Iho packelisalion parameters, and Ihe 
buffer receives the protocol header in a burst from 
the host via the bus and stores the protocol header 



wherein the increment takes a val ue so that the cells 
of the streams belonging to the same class are 
evenly spaced. 



10. An apparatus. according 4o,eJajm* 9, ^ wherein the 
packetisation parameters include size of the proto- 
col header. 

11. An apparatus according to claim 9 or 10, wherein 
the predetermined protocol is TCP(or UDP) and/or 
IP. 

12. An apparatus according to claim 9, 10 or 11, where- 
in the transmission unit comprises means for re- 
ceiving a partial checksum calculated by the host, 
updating the partial checksum to obtain a complete 
checksum and filling the complete checksum into 
the protocol header in the buffer when the protocol 

+ - 'header and>the'paylead«have>ti&W 
buffer. 



17. An apparatus, according lo any ol claims 13-16, 
wherein the first streams have high priority, and the 
traffic shaper comprises means for shaping traffic 

of the ATM celts 
having low priority with leaky bucket mechanism. 



13. 



1-12, 



14. 



15. 
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An apparatus according to any of claims 
wherein the traffic shaper comprises: 

means for classifying one or more first 
streams of the ATM cells into one or more classes, 
each class including one or more streams having 
the same bit rate characteristics; storage means for 
storing a set of parameters to control the bit rate for 
each class; and means for executing a rate pacing 
of each class according to the set of parameters in 
the storage means. 

An apparatus according to claim 13, wherein the pa- 
rameters include an ideal scheduled time and an 
increment for the ideal scheduled time, and the 
means for executing increments the ideal sched- 
uled time with the increment when a ATM cell of a 
current stream in a class is put into a transmit 
queue, compares a reference clock with the incre- 
mented ideal scheduled time and puts a ATM cell 
of a next stream in the class into the transmit queue 
when the incremented ideal scheduled time be- 
comes equal to or less than the reference clock. 

An apparatus according to claim 14, wherein the in- 
crement of each class takes two alternative values, 
one of the values indicating a bit rate higher than a 
required bit rate and the other indicating a bit rate 
lower than the required bit rate, and the means for 
executing alternates the values of the increment cy- 
clically. 



16. An apparatus according to claim 13, 14 or 15, 



18. An apparatus according to any of claims 13-17, 
wherein the first streams incl ude a VBR stream, and 
when a bit rate characteristic of the VBR stream 
changes, the means for classifying detaches the 
VBR stream from a current class and attaches the 
VBR stream to a new class, and the parameters of 
the current class and the new class are updated. 

19. An apparatus according to any of claims 1-18, 
wherein the means for performing VPI/VCI filtering 
comprises: 



a table for storing VPI/VCI entries and being di- 
>Wt%*»*^*i'rMde&\n\o sections^ 

means for distributing VPIA/CI entries into the 
sections in the table according to a portion of 
each VPI/VCI entry; 

means for selecting a section to be searched 
according to the portion of a VPI/VCI of a re- 
ceived ATM cell; and means for performing a 
search over the selected section to find an entry 
corresponding to the VPI/VCI of the received 
ATM cell and outputting an internal ID corre- 
sponding to the found entry. 



25 
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20. An apparatus according to claim 19, wherein the 
portion comprise the lower X bits of the VCI. 

21. An apparatus according to claim 19 or 20, wherein 
the entries are stored in an ordered way within each 
section and the search is a binary search. 

22. An apparatus according to any of claims 24-28, 
wherein the reception unit includes means for veri- 
fying a checksum in a protocol header of the incom- 
ing data. 

23. An apparatus according to any of claims 1-22, 
wherein the reception unit includes an address 
translator for gathering non-contiguous words from 
the buffer to create a bu rst data for the bus interface. 

24. A system comprising an apparatus according to any 
of claims 1 -23, the host for running applications and 
managing delivery of data, the storage device for 
storing the incoming data and retrieving the stored 
incoming data as the outgoing data, and the bus. 



12 



23 EP 0 847 216 A1 24 

25. System according to claim 24, wherein the storage 
device includes a RAID or RDI system, and the 
transmission unit includes a RAID function for cor- 
recting error in the outgoing data using RAID redun- 
dancy and changing whether performing an error 5 
correction or not according to the command. 

*fc^^te^&cc^^ - ' ■ * '* ?M*- 

storage device includes a RAID or RDI system, and 
adjacent words in.the bur^Jdata belong to different io 
stripe units. 

27. A system according to claim 24, 25 or 26, wherein 
the reception unit includes: 

75 

a pattern detector for detecting at least a preset 
bit pattern in the incoming data when the incom- 
ing data is transmitted from the ATM interface 
to the bus interface; and 

a list for storing location information corre- 20 
sponding to a location of the preset bit pattern 
in the data when the present bit pattern is de- 
tected by the pattern detector; 
i * ;£**^th^ data-from* .» 

the storage device to the network according to 25 
the location information in the list. 

28. A system according to claim 27, wherein the incom- 
ing data includes video data compressed according 

to MPEG standard and the location information rep- 30 
resents at least a location of I -picture, and the host 
controls the storage device so that the I -picture is 
accessed according to the location information. 

29. An apparatus according to any of claims 24-28, 35 
wherein the storage device includes a RAI D or RDI 
system, and the reception unit includes means for 
adding RAID redundancy to the incoming data. 

30. A method for sending/receiving data to/from an 40 
ATM network, using an apparatus according to any 

of claims 1 -23 and/or a system according to any of 
claims 24-29. 

45 
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